ML-Evaluation of Classifiers

Evaluation Criteria

Predictive accuracy:

A c c u r a c y = \frac{N u m b e r o f c o r r e c t c l a s s i f i c a t i o n}{T o t a l N u m b e r o f t e s t c a s e s}

Efficiency

Time to construct the model
Time to use the model

Robustness: handling noise and missing values Scalability: efficiency in disk-resident databases Interpretability: understandable and insight provided by the model Compactness of the model: size of the tree, or the number of rules.

**Precision and recall measures **

Confusion Matrix

	Classified Positive	Classified Negative
Actual Positive	TP	FN
Actual Negative	FP	TN

p r e c i s i o n = \frac{T P}{T P + F P}

r e c a l l = \frac{T P}{T P + F N}

F_{1} = \frac{2 \times p r e c i s i o n}{p r e c i s i o n + r e c a l l}

Roc Curve

True positive rate:

T P R = \frac{T P}{T P + F N}

False positive rate: / True Negative Rate

F P R = \frac{F P}{T N + F P}

How to compare 2 curves? Compute the area under the curve (AUC)

If AUC for $C_{i}$ is greater than that of $C_{j}$ , it is said that $C_{i}$ is better than $C_{j}$ . If a classifier is perfect, its AUC value is 1 If a classifier makes all random guesses, its AUC value is 0.5.

Evaluation Methods

Holdout set: The available data set D is divided into two disjoint subsets,

the training set $D_{t r a i n}$ (for learning a model)
the test set $D_{t e s t}$ (for testing the model)

Important: training set should not be used in testing and the test set should not be used in learning.

Unseen test set provides a unbiased estimate of accuracy.

The test set is also called the holdout set. (the examples in the original data set D are all labeled with classes.)

This method is mainly used when the data set D is large.

n-fold cross-validation:

The available data is partitioned into n equal-size disjoint subsets. Use each subset as the test set and combine the rest $n - 1$ subsets as the training set to learn a classifier.

The procedure is run n times, which give n accuracies.

The final estimated accuracy of learning is the average of the n accuracies.

10-fold and 5-fold cross-validations are commonly used.

This method is used when the available data is not large.

Algorithm

Tutorial

assignment

Assignment

As-1

As-2

Lab-1

Lab-2

Lab-3

Lab-4

GAMES101

Assignment-1

Assignment-2

Assignment-3

Assignment-4

Lab

Lecture

Peoject

CSCN

Ploidy

ML-Evaluation of Classifiers ​

Evaluation Criteria ​

Roc Curve ​

Evaluation Methods ​

ML-Evaluation of Classifiers

Evaluation Criteria

Roc Curve

Evaluation Methods